A new bioconcentration factor model based on SMILES and indices of presence of atoms.
نویسندگان
چکیده
Indices of the presence of atoms (IPA) encode the presence or absence of atoms, such as nitrogen, oxygen, sulphur, phosphorus, fluorine, chlorine, and bromine in a molecule. They are calculated with the simplified molecular input line entry system (SMILES). Using the Monte Carlo method for correlation weights of these indices, one can improve the predictive ability of optimal SMILES-based descriptors in quantitative structure-activity relationships (QSAR) for bioconcentration factor. The model without IPA gave the following results: n=503, r(2)=0.6803, q(2)=0.6781, s=0.759, F=1066 (subtraining set); n=322, r(2)=0.8181, r(pred)(2)=0.8159, s=0.565, F=1439 (calibration set); n=105, r(2)=0.6703, r(pred)(2)=0.6577, R(m)(2)=0.6628, s=0.728, F=209 (test set); n=106, r(2)=0.6624, r(pred)(2)=0.6502, R(m)(2)=0.6212, s=0.757, F=204 (validation set) The model with IPA gave: n=503, r(2)=0.7082, q(2)=0.7062, s=0.725, F=1216 (subtraining set); n=322, r(2)=0.8401, r(pred)(2)=0.8383, s=0.528, F=1682 (calibration set); n=105, r(2)=0.7489, r(pred)(2)=0.7402, R(m)(2)=0.7252, s=0.637, F=307 (test set); n=106, r(2)=0.7306, r(pred)(2)=0.7217, R(m)(2)=0.7010, s=0.680, F=282 (validation set).
منابع مشابه
CORAL: building up the model for bioconcentration factor and defining it's applicability domain.
CORAL (CORrelation And Logic) software can be used to build up the quantitative structure--property/activity relationships (QSPR/QSAR) with optimal descriptors calculated with the simplified molecular input line entry system (SMILES). We used CORAL to evaluate the applicability domain of the QSAR models, taking a model of bioconcentration factor (logBCF) as example. This model's based on a larg...
متن کاملToxicity and Bioconcentration of Cadmium and Copper in Artemia Urmiana Nauplii
Background: Artemia urmiana are small crustaceans that because of its non-selective filter feeder pattern potentially may absorb high level of heavy metals through their living environment. In this study, the effects of different levels of cadmium and copper on survival, catalase activity and metals bioconcentration rates in A. urmiana nauplii have been investigated. Methods: The research wa...
متن کاملAttitudes towards English Language Norms in the Expanding Circle: Development and Validation of a new Model and Questionnaire
This paper describes the development and validation of a new model and questionnaire to measure Iranian English as a foreign language learners’ attitudes towards the use of native versus non-native English language norms. Based on a comprehensive review of the related literature and interviews with domain experts, five factors were identified. A draft version of a questionnaire based on those f...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- European journal of medicinal chemistry
دوره 45 9 شماره
صفحات -
تاریخ انتشار 2010